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Introductory 

Conspicuous  among  the  major  concerns  of  present-day  psy- 
chology is  the  problem  of  the  nature  of  mental  organization. 
This  term  centers  in  itself  a  variety  of  questions  such  as: 
What  constitutes  a  single  mental  function?  How  does  it  op- 
erate? Over  how  wide  a  range  of  material  does  a  certain  abil- 
ity extend?  What  is  the  connection  between  abilities  which  in- 
trospectively  or  objectively  appear  alike  or  unlike  as  the  case 
may  be? 

Couched  in  its  modern  form,  this  problem  first  received  ex- 
plicit formulation  in  1904  in  the  historic  paper  of  Spearman 
(22),  although  its  adumbrations  were  already  discernible  in 
the  writings  of  Wolff  and  later  "faculty"  psychologists.  In  this 
article  Spearman  presented  evidence  which  seemed  to  indicate 
the  presence  of  a  "general  factor"  in  test  performances.  This 
contention  occasioned  a  controversy  which  even  now  remains 
unsettled.  His  interpretation  did  not  meet  with  a  very  favor- 
able reception  in  America,  and  German  psychology  was  not  yet 
sufficiently  imbued  with  the  statistical  spirit  of  Betz  and  Lip- 
mann  to  foster  the  idea  (see  10,  p.  504-518).  In  France, 
Binet  was  occupied  with  the  construction  of  his  first  intelli- 
gence test,  which  Spearman  claims  (22a,  p.  60,  66,  71)  simply 
carried  into  practice  his  own  suggestion  of  promiscuous  pool- 
ing for  the  measurement  of  general  ability,  but  neglected  to 
make  clear  the  theoretical  foundations  of  the  work.  The  lack 
of  an  adequate  groundwork  of  "principles"  has  unquestionably 
caused  many  difficulties,  such  as  those  which  honeycomb  the 
numerous  symposia  on  the  nature  of  intelligence.  It  would 
seem  after  a  generation  of  polemics  that  Spearman  had  some- 
what the  better  of  the  argument,  for  recently  the  "particular- 
ists" — to  borrow  a  phrase  from  continental  history — appear 
to  have  lost  their  chief  protagonist  when  Thorndike  committed 
himself  to  the  statement,  "One  cherishes  the  hope  that  some 
simpler,  more  unitary  fact  exists  as  the  cause  of  intellect  and 
that  variations  in  the  magnitude  of  this  fact  may  provide  a 
single  fundamental  scale  which  will  account  for  levels  and 
range  and  surface."  (23,  p.  412).  A  complete  summary  of  the 
controversial   literature  with  a  digest  of  the  laborious  re- 
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searches  of  the  British  school,  and  an  outline  of  his  present 
position  will  be  found  in  Spearman's  latest  volume  (22a)  .* 

Most  of  the  investigations  in  this  field  are  foreign  to  our 
immediate  interest,  but  there  is  at  least  one  pertinent  theoret- 
ical aspect:  Is  any  attribute — speed,  memory,  honesty,  ac- 
curacy— a  general  characteristic  of  an  individual  or  a  partic- 
ular one?  Do  these  qualities  appear  uniformly  in  such  a 
manner  that  if  a  person  exhibits  them  in  one  situation  he  will 
also  manifest  them  to  a  similar  degree  in  other  tasks? 

The  answers  to  these  questions  bring  us  into  the  field  of 
"group  factors,"  a  name  which  has  been  employed  to  differ- 
entiate those  broad  capacities  which  are  intermediate  in  scope 
between  "general  ability"  or  "general  intelligence"  and  the 
myriad  specific  functions  acquired  in  a  life-time  of  learning. 
If  one  adopts  the  view  that  all  the  "bonds"  involved  in  learned 
behavior  are  specific,  then  the  very  concept  of  a  group  factor 
is  either  mythical  or  superfluous,  unless  it  be  interpreted  as 
the  product  of  an  overlapping  of  common  elements.  Recent 
students  of  personality,  however,  find  this  view  in  conflict  with 
other  theoretical  considerations.  If  an  individual  is  simply 
a  bundle  of  habits  without  any  recognizable  systematic  inter- 
lacing among  them,  then  we  cannot  say  that  because  a  man  is 
neat  in  the  care  of  his  person  he  will  be  neat  in  the  care  of  his 
room;  or,  if  he  is  honest  in  money  matters  that  he  will  be 
honest  in  reporting  events.  But  common  speech  does  single 
out  such  broader  aspects  of  behavior  and  label  them.  To  most 
understandings,  an  energetic  man  is  not  only  forceful  in  bear- 
ing and  movement,  but  highly  katabolic  in  all  forms  of  work. 
This  attitude  seems  to  be  represented  by  Courtis  (6)  who  re- 
jects the  specificity  theory  while  admitting  the  proved  fact  that 
each  new  instance  of  behavior  differs  from  all  others.  He 
states  his  objection  in  these  words:  "It  closes  the  door  to  dis- 
covery. It  is  as  if  the  early  chemists  had  said,  'Each  type  of 
substance  is  specific.  Sugar  differs  from  butter  and  each  from 
water.  Therefore,  the  concept  of  elements  in  untenable.'  " 
As  a  working  hypothesis,  he  suggests  the  tentative  adoption  of 
the  opposite  view.  "Each  instance  of  behavior  is  specific.  The 
vast  array  of  specific  behaviors,  however,  seem  to  have  certain 
characteristics  in  common.  A  search  must  be  made  for  tend- 
encies which  seem  to  operate  in  different  situations,  and  the 
isolation  of  their  factors  attempted."  (6,  p.  86.) 


*  An  excellent  briefer  account  is  given  by  Dodd  (8). 
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Precision  of  definition  will  be  of  service  here  as  in  most  in- 
stances, hence  we  propose  to  term  every  quality  which  appears 
to  be  a  general  characteristic  a  "trait,"  using  tha't  word  as  de- 
fined by  G.  W.  Allport  (1) — "a  higher  integrated  habit-system 
which  represents  an  individual's  characteristic  mode  of  ad- 
justment to  environmental  activities". 

The  special  problem  of  this  study  is  to  determine  whether 
accuracy  conforms  to  such  a  definition.  Is  accuracy  a  habit 
of  a  simple  order  which  remains  attached  to  the  specialized 
behavior  which  it  qualifies,  or  is  it  a  more  highly  organized 
ability  represented  in  diverse  functions;  and  to  what  extent? 
A  solution  of  this  question  is  attempted  in  the  following  pages. 


II.  The  Problem  and  Its  Background 

Before  we  can  determine  whether  accuracy  be  an  integrated 
way  of  behaving,  it  will  be  necessary  to  examine  those  situa- 
tions or  processes  wherein  it  occurs.  Sensory  discriminations 
exhibit  the  trait  in  its  most  elementary  form:  perception  of 
time-intervals,  judgments  of  weights,  estimates  of  distances. 
Motor  activities — like  target-aiming  and  the  coordination  de- 
manded in  mirror  drawing — are  the  second  great  field  from 
which  illustrations  of  accurate  performance  may  be  drawn. 
Finally,  "intellectual"  responses  such  as  following  directions 
and  solving  simple  problems  may  be  analyzed  with  special 
reference  to  this  type  of  behavior. 

Since  accuracy  is  found  in  such  a  variety  of  acts  it  may  be 
useful  to  attach  some  uniform  meaning  to  the  word.  It  is  pro- 
posed here  to  consider  accuracy  as  that  characteristic  of  meas- 
urable behavior  which  is  indicated  by  relative  freedom  from 
errors — an  error  being  considered  as  a  deviation  from  an  ob- 
jectively correct  standard.  This  definition  appears  adequate 
for  the  present  undertaking,  as  it  makes  accuracy  a  definite 
quality  of  any  test  product. 

These  preliminary  remarks  have  seemed  essential  to  the 
final  phrasing  of  the  subject  of  this  investigation:  In  a  given 
group,  does  the  proportion  of  errors  which  each  individual 
makes  remain  constant  for  him  (relative  to  the  group  distribu- 
tion) no  matter  what  performance  he  undertakes? 

A  need  for  some  such  study  seems  to  be  apparent  from  the 
paucity  of  experimental  work  in  which  this  factor  is  empha- 
sized. Speed  has  always  had  its  obvious  share  of  attention, 
and  the  field  of  difficulty  has  recently  been  carefully  analyzed 
(see  Thorndike  23),  but  the  third  factor  in  the  numerator  of 
Woodworth's  "power"  formula  (30a,  p.  11-14)  : 

Power  =  Quantity  X  Difficulty  X  Excellence  of  Result  -f-  Time 

has  been  relatively  neglected.  To  be  sure,  the  work  on  hand- 
writing, composition,  and  other  educational  scales,  appears  to 
indicate  the  contrary,  but  a  measurement  of  "excellence  of 
result"  in  the  form  of  accuracy  over  a  wide  range  of  capacities 
has  never  been  made  the  primary  task  of  an  investigation. 

Since  the  problem  in  this  form  seems  to  have  practically  no 
historical  antecedents,  despite  the  fact  that  wide  individual 
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differences  in  accuracy  are  obvious  in  all  test  results,  it  would 
be  well  to  inquire  just  what  related  literature  is  available. 
Wells  (26)  appears  to  have  been  the  first  to  point  out  the  pos- 
sibilities of  this  field.  He  constructed  a  battery  with  the  fol- 
lowing test  components:  1.  Addition  (of  one  constant  quan- 
tity to  twenty  two-place  numbers)  ;  2.  Long  Division  problems ; 
3.  Arrangement  of  numbers  in  order  of  size ;  4.  Alphabetizing 
of  ten  names;  5.  Copying  nine-place  numbers;  6.  Letter- 
square  substitution  of  symbols  (20,  p.  179).  "It  is  intended 
that  the  problems  of  this  test-series  shall  all  be  well  within  the 
range  of  the  subjects'  information.  The  quality  of  performance 
then  depends  in  the  first  instance  upon  the  care  with  which 
the  tasks  are  accomplished,  not  upon  special  information  of 
correct  answers  or  ability  to  reason  them  out.  In  subjects  for 
whom  the  series  is  suited,  performance  is  limited  essentially  by 
the  subject's  carefulness.  Other  test  series  are  limited  by 
knowledge  and  reasoning  ability." 

It  will  be  observed  that  Wells  has  pointed  out  a  condition 
which  all  tests  of  accuracy  must  meet — viz.,  adaptation  to  a 
homogeneous  group.  Since  the  material  employed  must  be 
within  the  range  of  difficulty  of  all  the  subjects,  one  must  have 
recourse  to  relatively  simple  tasks,  such  as  those  listed  above. 
When  this  prerequisite  is  satisfied,  differences  in  "altitude" 
cease  to  be  a  disturbing  factor,  even  though  they  cannot  be  said 
to  be  equalized.  It  removes  the  possibility  of  having  inac- 
curate performance  by  an  otherwise  "careful"  individual, 
whose  errors  are  primarily  attributable  to  his  attempt  to  work 
beyond  his  level. 

The  early  work  of  Krueger  and  Spearman  (16)  employed  a 
technique  which  has  now  become  conventionalized  for  studies 
in  "factorial"  psychology.  To  eleven  subjects  these  authors 
gave  five  tests — pitch  discrimination,  spatial  thresholds,  add- 
ing, memory  span,  and  a  completion  test — and  computed  the 
intercorrelations.  The  now  supplanted  criterion  of  the 
hierarchical  arrangement  of  the  coefficients  was  used  to  dem- 
onstrate the  existence  of  a  "general  factor"  in  all  these  tests. 
In  the  case  of  the  adding  tests,  separate  correlations  between 
them  when  scored  for  speed  and  accuracy  independently  and 
the  other  tests  were  obtained.  The  authors'  justification  for  this 
was  that  they  were  compelled  to  disregard  "Geschwindigkeit" 
while  studing  "Genauigkeit"  on  account  of  technical  difficulties 
(wegen  technischer  Schwierigkeiten) .  (16,  p.  91.) 
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A  later  paper  of  Thorndike,  Lay,  and  Dean  (23a)  on  the 
"Relation  of  Accuracy  in  Sensory  Discrimination  to  General 
Intelligence"  had  approached  the  problem  with  a  different 
objective.  These  authors  took  the  A.D.  from  the  standard  as  a 
measure  of  inaccuracy  in  the  case  of  tests  with  lines  and 
weights,  and  correlated  the  obtained  values  with  intellect  as 
judged  by  teachers  and  fellow-students.  These  measures  were 
simply  samples  of  two  mental  fields  presumably  at  opposite 
poles  of  complexity.  A  correlation  of  .23  was  interpreted  to 
imply  the  relative  independence  of  the  two  functions,  and  used 
as  a  refutation  of  Spearman's  original  contention.  What  would 
have  been  of  interest  was  the  correlation  between  the  ranks  of 
the  subjects  for  linear  discrimination  and  the  magnitude  of 
their  thresholds  for  lifted  weights.  Unfortunately,  this  was 
not  presented. 

The  same  general  topic  was  studied  in  the  midst  of  the 
"formal  discipline"  era  by  Winch  (28).  Asserting  that  peda- 
gogs  aim  to  inculcate  a  habit  of  accuracy  in  schoolwork  in  the 
hope  that  it  will  later  operate  in  life-situations,  he  asked.  Will 
training  in  accurate  numerical  computation  transfer  to  arith- 
metical reasoning?  By  the  use  of  the  method  of  equivalent 
groups  he  came  to  the  conclusion  that  "It  seems  possible  to  im- 
prove the  accuracy  of  numerical  computation  without  any  cer- 
tainty that  we  shall  thereby  improve  the  accuracy  of  arith- 
metical reasoning."  A  later  study  of  his  (28a)  appeared  to 
confirm  this  view  which  he  phrases  as  follows :  "There  exists 
a  high  positive  correlation  of  .74  between  the  two  functions, 
numerical  computation  and  arithmetical  reasoning.  This  high 
correlation  does  not  appear  to  involve  such  a  community  of 
function  that  improvement  in  one  function  involves  improve- 
ment in  the  other." 

Another  study  whose  present  relationship  is  rather  indirect 
was  made  by  Trow  (24)  on  the  general  factor  of  confidence. 
He  derived  a  correlation  of  .03  for  all  subjects  between 
achievement  scores  (correctness  of  judgment)  and  confidence 
scores  (subjective  certainty)  on  the  same  tests.  Variability 
and  specificity  of  the  confidence  trait  were  pronounced.  "Any 
test  of  confidence  which  does  not  cover  a  wide  number  of  situa- 
tions is  futile  as  an  indicator,  and  its  predictions  will  as  likely 
be  wrong  as  right.  It  seems  possible  that  this  is  also  true  of 
other  character  traits.  The  traits  are  not  constants,  but  vary 
with  the  varying  situations." 
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Among  the  numerous  works  on  the  relation  of  accuracy  to 
speed,  that  of  Garrett  (12)  may  be  mentioned  as  representa- 
tive. The  existence  of  an  optimum  rate  of  lifting  weights  was 
established,  and  there  seemed  to  be  evidence  that  men  who 
ranked  high  in  one  rate  tended  to  hold  their  relative  position 
in  others.  A  certain  narrow  type  of  consistency  appears  to 
obtain  in  this  process. 

Accuracy  has  always  been  considered  one  of  the  best  indices 
of  mental  efficiency  and  fluctuations  in  this  trait  have  loomed 
large  in  the  researches  of  pharmaceutical  psychology.  Typical 
of  this  field  is  the  study  of  Hull  (14)  on  the  effect  of  tobacco 
smoking  on  the  accuracy  of  cancelling  A's  and  mental  addition. 

Suggestions  as  to  the  technique  to  be  employed  in  the  solu- 
tion of  our  group  factor  type  of  problem  have  been  derived 
from  various  sources,  such  as  Hargreaves'  (13)  study  of  the 
imagination,  Bernstein's  (3)  work  on  quickness,  Davey's  (7) 
on  "pictoriality,"  the  dissertation  of  Brown  (4)  on  caution, 
and  that  of  Dowd  (9)  on  the  consistency  of  the  rate  of  work. 
In  all  these  cases  a  definite  attribute  of  a  test  product  was  ex- 
amined for  its  degree  of  "spread"  or  generality,  but  the  results 
uniformly  indicate  that  none  of  these  traits  are  "broad"  func- 
tions. For  example,  Davey  found  that  a  verbal  mental  test 
measures  the  same  general  factor  "g"  as  does  a  test  similar  in 
form  but  non-verbal  in  material,  such  as  pictures,  from  which 
she  concludes  that  there  is  no  group  factor  of  "pictoriality." 

The  investigations  of  Bernstein  and  Dowd  are  of  especial  in- 
terest because  both  were  unable  to  discover  any  evidence  for 
a  general  speed  factor.  Bernstein  found  no  significant  difference 
between  the  correlations  of  his  "leisure"  and  "haste"  tests 
with  a  criterion  of  intelligence;  this  was  confirmed  by  Dowd 
who  showed  that  the  intercorrelations  for  tests  of  rate  of  work 
for  dissimilar  performances  were  very  low. 

Positive  results  in  the  attempted  isolation  of  new  group 
factors  have  been  achieved  by  a  number  of  Spearman's  pupils. 
We  may  mention  the  work  of  Carey  (5)  on  the  nature  of 
specific  mental  factors.  He  was  unable  to  unearth  a  discrimi- 
natory function  apart  from  "g,"  but  he  did  seem  to  find  evidence 
for  the  existence  of  a  small  general  memory  factor.  Webb's 
(25)  study  is  especially  noteworthy  as  he  apparently  demon- 
strated the  existence  of  a  new  general  factor  of  "will"  (i.e., 
persistence  of  motives).  This  trait  functions  in  the  conative 
sphere  in  much  the  same  way  that  "g"  operates  in  the  cognitive 
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life.  The  factor  of  perseveration,  or  persistence  of  impres- 
sions, appears  to  have  been  established  by  the  experiments  of 
Lankes  (17),  and  the  elaborate  mathematical  analysis  of 
Webb's  data  made  by  Garnett  (11)  disclosed  a  new  magnitude 
termed  "cleverness,"  a  trait  which  is  considered  to  be  the 
obverse  of  perseveration.  An  incidental  result  of  all  these  in- 
vestigations is  that  the  methodics  of  determining  generality 
have  become  pretty  well  standardized,  thereby  facilitating 
comparison  of  data. 


III.  The  Tests  and  Procedure 

1.  Methodological  Requirements. 

To  discover  whether  accuracy  be  a  group  factor  distin- 
guished by  functional  unity  we  must  first  meet  the  require- 
ment of  having  a  comparatively  large  number  of  tests.  The 
justification  of  this  demand  is  that  no  one  measure  of  any 
attribute  is  an  adequate  index  of  how  much  an  individual 
possesses  of  it.  This  is  almost  axiomatic  and  follows  from  the 
principle  that  any  measurement  of  ability  is  in  essence  an  in- 
ventory; consequently,  the  greater  the  variety  of  tasks  pre- 
sented, the  greater  will  be  the  probability  that  the  obtained 
magnitudes  will  be  fair  representatives  of  the  true  ones. 

Another  desideratum  to  be  fulfilled  is  that  the  size  of  the 
group  employed  be  large  enough  to  serve  as  an  adequate 
sample  of  the  particular  population  from  which  it  is  drawn. 
Failure  to  work  with  reasonably  large  groups  is  a  common 
defect  of  many  of  the  early  studies  in  this  field,  both  on  the 
part  of  Spearman's  disciples  and  his  opponents.  In  this  in- 
vestigation eighty-nine  Columbia  University  undergraduates, 
all  men,  were  used  as  subjects  for  the  experiments  to  be  de- 
scribed. They  were  all  of  sophomore  rank  or  beyond,  students 
in  classes  of  elementary  psychology,  and  each  man's  presence 
in  the  experimental  program  was  purely  voluntary.  A  number 
of  the  records  were  incomplete  or  unserviceable  because  of 
technical  errors,  so  that  the  final  number  was  reduced  to 
eighty-one.  The  testing  period  for  each  man  was  about  two 
and  one-half  hours,  a  little  over  an  hour  being  spent  on  two 
separate  days. 

All  the  tests  were  administered  individually.  This  would 
have  been  inevitable  with  some  of  the  tests,  but  it  was  decided 
to  extend  this  procedure  to  every  test,  partly  for  the  sake  of 
uniformity  and  partly  to  exclude  the  influence  of  social  stimuli 
upon  work  done  under  competition.  This  precaution  was  sug- 
gested by  the  results  of  experiments  by  Mayer,  Schmidt,  Meu- 
mann,  and  Allport,  which  indicate  that  "stimuli  from  the 
group  have  a  favorable  effect  upon  the  amount  but  not  upon 
the  quality  of  work."  (2,  p.  269.) 

2.  Selection,  Description,  and  Administration  of  the  Tests 
The  choice  of  the  tests  to  be  used  was  the  first  major  diffi- 
culty encountered  in  the  course  of  this  investigation.    If  we 
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cannot  even  be  certain  that  a  completion  test  is  a  measure  of 
reasoning  ability,  then  it  is  clear  that  simply  calling  a  test  a 
"test  of  accuracy"  does  not  make  it  such.  But  this  does  not 
prohibit  an  attempt  to  see  whether  accuracy  defined  as  rela- 
tive freedom  from  errors  in  different  test-situations — regard- 
less of  the  major  ability  involved — ^behaves  as  a  unit  factor 
or  not. 

More  than  a  score  of  tests  were  considered  for  this  purpose. 
The  ones  finally  selected  appeared  to  the  experimenter  to  give 
the  maximum  diversification  of  material  available  within  the 
restricted  time-limits.  Among  the  tests  originally  proposed, 
but  ultimately  eliminated,  were:  simple  letter  cancellation, 
target-aiming,  McDougall  dotting  test,  the  tracing  board, 
mirror-dra\ving,  estimates  of  time-intervals,  jumbled  para- 
graphs, the  I-T  test  (see  Hargreaves,  13),  judgments  of  hand- 
writing, from  a  series  of  pairs  of  digits  marking  those  pairs 
where  the  difference  between  the  digits  equalled  three,  and 
the  Thurstone  Clerical  Test.  The  reasons  for  elimination  va- 
ried, but  the  desire  to  avoid  duplication  and  to  insure  ease  and 
simplicity  of  administration  were  the  chief  motives.  The 
tests  remaining  seemed  to  possess  most  of  the  desirable  fea- 
tures of  the  rejected  tests,  and  to  offer  the  possibility  of  meas- 
uring the  greatest  number  of  typical  abilities  in  which  accu- 
racy plays  a  part. 

Records  were  obtained  from  each  subject  for  the  eleven 
tests  listed  below.  The  number  and  order  of  the  tests  de- 
scribed in  the  following  paragraphs  will  be  constant  through- 
out this  report. 

(1)  Discrimination  of  lengths.  The  Galton  bar  and  acces- 
sories were  placed  in  uniform  diffuse  illumination.  The  sub- 
ject was  seated  with  his  back  to  the  light,  about  five  feet  before 
the  bar,  and  made  his  adjustments  by  turning  the  attached 
rod.  He  was  kept  in  ignorance  of  the  size  of  the  standard 
being  used,  and  was  instructed  to  make  his  adjustments  rap- 
idly and  with  as  little  hesitation  as  possible.  After  each 
adjustment  the  experimenter  recorded  the  reading  to  the 
nearest  millimeter  on  a  specially  prepared  sheet.  First  a 
ten-centimeter  standard  was  set  to  the  left  of  the  central 
division  point  on  the  Galton  bar;  the  subject  was  directed  to 
lay  off  an  equivalent  distance  to  the  right.  Ten  trials  were 
taken  from  the  center  outwards  and  ten  in  the  reverse  direc- 
tion, making  twenty  readings  for  this  standard.     A  similar 
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record  was  secured  with  the  standard  distance  set  at  twenty 
centimeters. 

(2)  Discrimination  of  brightness.  The  Woodworth  photom- 
eter containing  a  four  candle-power  stimulus  light  on  a  slid- 
ing lever  was  employed  in  a  dark  room  with  subjects  who  had 
been  dark-adapted  for  about  twenty  minutes  before  the  series 
was  begun.  The  subject  sat  with  his  eyes  on  a  level  with  the 
viewing-tube  of  the  instrument  and  about  two  inches  from  it. 
The  pointer  was  set  at  zero,  so  that  the  two  windows  were 
equally  illuminated.  The  experimenter  then  began  to  move 
the  bar  slowly  in  the  box;  the  subject  who  had  been  directed 
to  consider  the  left  window  as  the  standard  reported  as  soon 
as  he  believed  the  right  window  to  have  become  just  notice- 
ably darker  than  the  left.  The  pointer  reading  in  millimeters 
was  recorded  under  "increasing  differences".  Beginning  at  a 
point  where  the  right  side  was  definitely  darker  than  the  left, 
the  motion  of  the  bar  was  reversed  until  the  subject  reported 
the  right  window  equal  to  the  left.  This  record  was  entered 
under  "decreasing  differences".  Corresponding  determina- 
tions were  made  on  the  opposite  side  of  the  zero  mark.  The 
entire  cycle  of  four  readings  was  performed  five  times,  mak- 
ing twenty  readings  in  all.  A  similar  procedure  was  under- 
taken with  an  eight  candle-power  lamp. 

(3)  Pitch  discrimination.  The  Seashore  phonograph  rec- 
ord was  used  in  this  connection,  and  administered  in  accord- 
ance with  the  manual  of  instructions  issued  by  the  manufac- 
turers. 

(4)  Discrimination  of  intensity  of  sound.  Here,  too,  the 
Seashore  record  seemed  to  be  appropriate  to  our  purposes. 

(5)  Discrimination  of  lifted  weights.  The  usual  revolving 
table  weights  of  the  laboratory  appeared  too  cumbersome  for 
this  diagnosis,  and  recourse  was  had  to  the  lighter  cartridge 
weights  and  the  method  suggested  by  Whipple.  His  recom- 
mended procedure  involves  the  use  of  an  eighty-gram  standard 
and  twenty-three  heavier  comparison  weights.  The  pro- 
cedure given  in  his  text  (27,  I,  p.  225)  was  followed  to  the 
letter.  This  method  is  relatively  crude,  but  our  justification 
for  using  it  is  that  the  determination  of  an  exact  difference 
limen  with  a  minute  and  elaborate  technique  is  quite  a  dif- 
ferent process  from  this  simple  determination  of  functional 
capacity  for  comparative  purposes.  In  our  experiment  two 
differential    thresholds    were    obtained    independently — one . 
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around  the  eighty-gram  weight  as  a  standard  and  the  othei| 
around  ninety  grams. 

(6)  Attention.  This  is  a  hidden  word  test  taken  froinj 
W.  S.  Foster's  "Notes  for  Instructors  to  Accompany  Experi- 
ments in  Psychology,"  p.  9-11  (see  Appendix  I).  With  a  time 
limit  of  two  minutes,  the  subject  was  directed  to  underline i 
all  common  English  words  of  two,  three,  or  four  letters  to  be 
found  in  paragraphs  of  pied  material.  A  cross  was  made  at  I 
the  point  reached  in  the  text  when  the  end  of  the  period  was 
announced  by  the  experimenter.  An  alternate  form  was  given 
so  that  the  test's  reliability  would  be  available. 

(7)  Recognition.  This  is  a  stock  memory  experiment  of 
the  manuals.  The  subject  was  shown  twenty  words  printed 
on  cards  at  the  rate  of  about  one  item  per  second ;  these  were 
then  mixed  with  twenty  confusion  stimuli,  and  the  shuffled 
pack  returned  to  the  observer  to  be  sorted  into  two  piles — one 
containing  those  cards  he  believes  he  saw  and  another  con- 
taining those  he  did  not  see.  He  was  permitted  to  make  a 
third  pile  for  doubtful  cases,  but  subjects  rarely  availed  them- 
selves of  this  privilege,  since  it  was  explained  to  them  that  it 
was  not  to  their  advantage  to  do  so.  A  second  set  with  differ- 
ent words  was  given  as  a  check  on  the  first  (see  Appendix  II) . 

(8)  Aussage.  This  is  the  standard  picture  report  or  tes- 
timony experiment  recommended  by  Whipple  (27,  II,  p.  27). 
The  observer  was  shown  the  lithograph  called  the  "Austra- 
lians" for  twenty  seconds  and  then  handed  the  questionnaire 
given  on  page  29  of  the  preceding  reference.  Subjects  were 
told  to  answer  as  many  items  as  they  could,  but  freedom  to 
guess  or  omit  answers  was  accorded.  In  addition,  the  litho- 
graph entitled  "The  Disputed  Case,"  a  more  difficult  picture 
for  the  perception  of  details,  was  given.  Following  the  pres- 
entation, the  questionnaire  from  pages  27-28  of  Whipple's 
text  was  filled  in  at  the  subject's  leisure. 

(9)  Perseveration.  Adoption  of  this  test  was  suggested 
by  Bernstein's  (3,  p.  12)  use  of  it  in  another  context.  It  con- 
sists of  an  irregular  pattern  of  lines.  For  the  first  trial  the 
subject  was  required  to  reproduce  as  many  sections  as  he 
could  in  a  minute.  The  pattern  was  again  returned  to  the  sub- 
ject, this  time  with  changed  instructions  as  follows:  "Copy 
this  pattern  in  the  spaces  provided  below  in  accordance  with 
the  following  scheme:  Convert  every  horizontal  line  into  a 
vertical,  and  every  vertical  into  a  horizontal,  retaining  the 
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obliques  unmodified."  Obviously  interference  effects  are  pro- 
nounced in  this  set,  so  the  time  was  extended  to  two  minutes. 
When  half  of  the  interval  had  expired,  it  was  announced  by 
the  experimenter  and  the  subject  made  a  mark  in  his  copy. 
This  permitted  the  calculation  of  the  reliability  coefficient. 

(10)  Wells  test-battery.  This  has  already  been  described 
on  page  9.  No  further  instructions  were  added  to  those  given 
on  the  blank,  save  that  the  subjects  were  told  at  the  outset  to 
work  as  rapidly  and  as  correctly  as  possible. 

(11)  Otis  directions  test.  The  first  four  elements  in  Test 
I  of  the  Otis  Advanced  Group  Examination  were  eliminated  as 
too  easy,  leaving  sixteen  items  to  be  employed  here.  It  seemed 
fair  to  consider  errors  in  this  task  as  due  primarily  to  care- 
lessness, since  Test  I  of  Army  Alpha,  a  similar  series,  rarely 
yields  any  mistakes  with  persons  of  college  level. 

It  will  be  noticed  at  once  that  the  eleven  tests  fall  into  two 
broad  classes:  the  first  five  are  really  measures  of  sensory 
discrimination,  and  as  such  might  be  called  "precision"  tests, 
because  they  are  an  index  of  the  general  goodness  of  the  af- 
ferent nervous  system,  showing  the  limits  of  its  capacity ;  the 
last  six  instead  demand  higher  perceptual  processes  for  their 
execution,  and  may  be  called  "accuracy"  tests  by  contrast,  as 
they  measure  the  usual  or  customary  level  of  correctness.  The 
first  group  of  tests  shows  how  accurately  a  man  can  work ;  the 
second  group  indicates  how  accurately  he  does  work. 

3.  Scoring  the  Tests 

The  scoring  technique  employed  in  this  study  was  especially 
selected  to  accord  with  the  nature  of  our  problem,  and  is  based 
upon  certain  assumptions  which  will  demand  consideration 
in  the  interpretation  of  the  final  results.  The  procedure 
adopted  for  each  test  follows : 

(1)  In  the  case  of  the  Galton  bar,  the  crude  average  error 
of  each  individual — i.e.,  the  average  deviation  of  all  settings 
from  the  true  setting — was  computed  for  both  standards.  Pre- 
sumably the  smaller  this  value  the  more  precise  the  person 
who  gave  it. 

(2)  For  the  photometer  the  average  of  all  twenty  readings 
yielded  the  just  noticeable  difference. 

(3)  The  subject's  judgment  of  pitch  was  graded  according 
to  the  key  furnished  by  Seashore.    The  largest  number  of  cor- 
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rect  responses  distinguished  the  most  capable  man  in  this 
test. 

(4)  Intensity  was  scored  in  the  same  manner  as  pitch. 

(5)  Determination  of  the  limen  for  lifted  weights,  as  usual 
in  a  short  series,  was  indirect.  The  number  right — e.g.,  6,  7, 
8,  and  9 — was  found  for  each  comparison  weight,  and  the 
75%  threshold  for  each  computed  by  means  of  the  FuUerton- 
Cattell  table.  These  derived  values  for  each  comparison 
weight  were  then  averaged  to  give  the  final  relative  threshold. 

(6)  The  attention  test  was  scored  on  the  basis  of  the  per- 
centage which  the  correctly  underlined  words  were  of  the  total 
number  attempted.  This  was  easily  obtained  as  the  subject 
had  made  a  cross  under  the  letter  he  was  examining  when 
time  was  called  on  him.  It  was  assumed  that  where  this  pro- 
portion was  high,  the  greater  must  have  been  the  degree  of 
care  exerted  to  attain  that  result. 

(7)  The  individual's  ability  in  recognition  was  obtained  by 
taking  twice  the  number  of  misplaced  cards  in  each  pile  and 
subtracting  from  the  total  in  both  piles.  To  illustrate:  One 
subject  had  22  cards  in  his  "Seen"  group  and  18  in  his  "Not 
Seen"  group,  with  6  errors  in  the  former  and  4  in  the  latter. 
His  score  was  40  —  10  (2)  =  20  right;  or  50%.  The  few  in- 
dividuals who  made  a  "Doubtful"  pile  were  ranked  by  adding 
twice  the  number  of  errors  to  the  doubtful  cases,  and  then 
subtracting  this  total  from  40. 

(8)  The  "Aussage"  tests  were  scored  by  finding  each  per- 
son's percentage  of  right  answers  out  of  the  total  number  of 
questions  attempted.  This  amounts  to  a  disregard  of  the 
extent  of  a  man's  testimony  and  to  limiting  comparison  to  the 
fidelity  of  the  report  he  wishes  to  give.  The  maximum  score 
is  20  possible  right  items  for  the  Australians,  and  60  possible 
for  the  Disputed  Case. 

(9)  The  perseveration  test  was  not  marked  after  the  man- 
ner of  Bernstein  (3,  p.  12),  but  for  the  sake  of  uniformity  in 
our  procedure  the  per  cent  right  of  those  attempted  was  ob- 
tained for  the  second  trial  only.  An  error  consisted  in  any 
line  omitted  or  drawn  in  a  different  direction  from  that  re- 
quired by  the  model  and  the  instructions. 

(10)  In;  the  Wells  test  the  amount  done  was  constant  for 
each  subject.  By  considering  the  two  division  problems  as 
equivalent  to  five  units  each,  the  total  possible  number  of  items 
becomes  66.    The  per  cent  right  was  also  secured  here. 
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(11)  The  Otis  test  was  another  amount-limit  one.  The  key 
furnished  by  the  author  was  used,  leading  to  a  score  in  terms 
of  per  cent  right. 

The  alternate  forms  in  tests  6,  7,  8,  and  10  were  combined 
by  averaging  the  individual's  per  cent  right  in  both.  It  is 
this  joint  measure  which  was  later  employed  in  the  intercorre- 
lation  of  the  tests. 

4.  Sources  of  Error  in  the  Experimental  Conditions 

A  legitimate  objection  to  the  scoring  procedure  used  in  tests 
6,  9,  10,  and  11  would  be  that  it  is  arbitrary  to  disregard  the 
speed  with  which  a  subject  works,  because  one  man  may  do 
only  ten  correct  items  of  twenty  attempted  and  another  thirty 
out  of  sixty,  and  yet  both  receive  a  50%  accuracy  standing. 
The  only  way  to  remove  this  objection  would  be  to  weight  the 
accuracy  of  the  performance  by  the  rate  of  work.  This,  how- 
ever, introduces  a  total  efficiency  score,  which  was  considered 
irrelevant  to  our  purpose. 

The  first  five  tests  in  our  series  seem  to  be  free  from  this 
speed  complication.  In  tests  1  and  2,  the  rate  at  which  the 
subjects  worked  was  regulated  by  their  own  convenience,  and 
in  tests  3,  4,  and  5,  they  simply  had  to  work  in  rhythm  with 
the  stimuli. 

The  "accuracy"  tests,  however,  are  not  so  easily  disposed  of, 
because  the  scores  obtained  by  the  method  used  in  the  pre- 
ceding section  are  probably  affected  by  both  speed  and  diffi- 
culty elements.  Theoretically,  the  difficulty  is  not  the  same  for 
every  subject,  because  of  differences  in  practice,  interest,  etc. 
However,  it  is  hard  to  see  how  this  can  vary  to  any  great  extent 
with  the  simple  tasks  here  presented  to  a  selected  body  of  col- 
lege students.  To  be  sure,  it  is  more  difficult  for  some  men  to 
be  accurate  than  others.  A  discussion  of  the  presumptive 
effects  of  differences  in  intellectual  level  as  such  is  postponed 
to  Section  7  of  Chapter  IV. 

There  is  no  speed  element  present  in  tests  7  and  8 ;  in  tests 
10  and  11,  the  subjects  under  identical  instructions  chose  their 
own  rates  of  work.  The  resulting  accuracy  scores,  however, 
are  just  as  indicative  of  carefulness,  even  if  they  are  the  re- 
sult of  varying  speeds.  Diagnostically,  then,  there  can  be  no 
objection  to  a  comparison  made  from  these  figures. 

The  speed  factor  present  in  tests  6  and  9  cannot  be  ex- 
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plained  away  as  in  the  other  instances.  Special  treatment  is 
accorded  this  topic  in  Section  8  of  Chapter  IV. 

Certainly  the  subjects  of  the  experiment  were  completely 
unaware  of  the  objective  of  the  testing  program.  They  were 
informed  that  this  was  a  study  designed  to  see  whether  their 
ability  in  one  test  was  accompanied  by  ability  in  another  field, 
and  most  of  them  seemed  to  be  well  satisfied  with  this  gen- 
eral explanation.  No  special  set  toward  either  speed  or  ac- 
curacy was  given  by  the  instructions  preceding  each  "paper 
and  pencil"  test. 

It  may  further  be  remarked  that  the  retention  of  the  space 
error  in  the  case  of  the  Galton  bar  and  the  photometer  in  no 
way  affects  the  course  of  this  problem.  Elimination  of  the 
space  error  would  have  been  essential  in  a  study  of  absolute 
limens,  but  where  relative  thresholds  are  our  objective,  as  in 
the  present  instance,  the  uniformity  of  the  experimental  con- 
ditions assures  valid  comparisons  of  individuals. 


IV.     PRESENTATION  AND  DISCUSSION  OF  RESULTS 

1.  The  Means  and  Standard  Deviations 

Table  I  summarizes  the  original  data  for  all  eleven  tests. 
The  numerical  values  in  all  cases  represent  the  group's  aver- 
age per  cent  right  of  the  items  attempted,  except  for  tests  1, 

2,  and  5  which  are  given  in  threshold  values.    The  records  are 
complete  throughout  for  eighty-one  subjects. 

Through  the  files  of  the  University  Admissions  office*,  ac- 
cess was  had  to  the  Thorndike  scores  of  the  students  with  a 
view  to  determining  the  representativeness  of  the  experi- 
mental group  of  the  Columbia  College  population.  Unfortu- 
nately, only  sixty-eight  records  were  available,  but  these  gave 
an  average  Thorndike  score  of  82.15,  S.  D.  13.29.  This  is  not 
significantly  different  from  the  figures  given  by  Wood  (29,  p. 
51) .  Apparently,  then,  the  subjects  who  voluntarily  presented 
themselves  for  experimentation  were  a  fairly  random  sample 
of  their  class — at  least  as  far  as  intelligence  is  concerned. 

2.  The  Reliability  Coefficients  of  the  Tests 

Reliability  coefficients  for  tests  1  to  5,  inclusive,  were  found 
by  correlating  the  magnitudes  of  the  first  half  of  each  test  with 
the  values  of  the  second  half,  and  correcting  the  resulting 
coefficient  by  the  Spearman-Brown  prophecy  formula  for  a 

TABLE   I 

Means,  Standard  Deviations,  and  Errors  of  Measurement 
(in  terms  of  per  cent  right  unless  otherwise  indicated) 


Test 

Mean 

S.D. 

S.D.(m) 

S.D.(m) 
Average 

1. 

Lengths    (in   cms. 

.) 

9.07 

4.91 

.0982 

.0108 

2. 

Brightnesses    (in 

cms.) 

5.16 

1.23 

.0369 

.0072 

3. 

Pitch 

77.19 

10.82 

3.2460 

.0421 

4. 

Intensity 

89.61 

5.75 

2.7600 

.0308 

5. 

Weights  (in  gms.) 

1 

9.35 

3.54 

2.7258 

.2915 

6. 

Attention 

62.56 

13.91 

6.8159 

.1089 

7. 

Recognition 

68.12 

15.28 

9.7792 

.1436 

8. 

Aussage 

63.92 

8.90 

7.8320 

.1225 

9. 

Perseveration 

55.74 

20.95 

7.7515 

.1391 

10. 

Wells 

87.01 

12.71 

8.3886 

.0964 

11. 

Otis 

79.66 

11.38 

8.8764 

.1114 

Tests  1,  2, 

and  5 

are  given  in 

threshold  values. 

*  The  writer  is  under  obligation  to  Director  A.  L.  Jones  for  his  as- 
sistance in  this  regard. 
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test  of  double  length.  In  tests  6,  7,  and  10  the  reliability 
was  derived  by  the  administration  of  two  alternate  and  pre- 
sumably equivalent  forms  of  the  tests.  Test  9  yielded  its  reli- 
ability by  means  of  the  split-half  technique,  and  in  test  11  the 
odd  items  were  correlated  with  the  even.  Test  8 — ^the  Aussage 
— was  treated  in  a  slightly  different  fashion,  since  it  was  de- 
sired to  combine  the  results  on  both  pictures  into  a  joint  meas- 
ure of  fidelity  of  report.  As  the  questionnaires  were  of  un- 
equal length — ^the  Australians  containing  only  20  items  and 
the  Disputed  Case  60 — the  per  cent  right  in  the  top  half  of 
each  was  added  to  the  per  cent  right  in  the  bottom  half  of  the 
other,  from  which  the  self-correlation  was  obtained.  In  all 
instances  the  Spearman-Brown  prophecy  formula  was  used. 

The  reliability  coefficients  thus  secured  will  be  found  in 
Table  III,  where  they  lie  along  the  diagonal  of  italicized  fig- 
ures. It  will  be  observed  that  the  tests  of  a  psycho-physical 
nature  are  in  general  superior  in  self -consistency  to  the  more 
complex  perceptual  variety.  Many  of  the  reliabilities  are  dis- 
appointingly low,  particularly  the  Aussage,  the  Otis  Direc- 
tions, and  the  Lifted  Weights.  The  unreliability  of  the  Otis 
directions  test  is  surprising,  in  view  of  the  fact  that  it  is  an 
integral  part  of  an  accepted  standard  measure  of  intelligence. 
Inspection  of  the  frequency  of  errors  in  the  different  items  of 
that  test  seemed  to  indicate  that  the  even  elements  caused 
more  mistakes  than  the  odd ;  this  hypothesis  was  tested  by  cor- 
relating pairs  of  items  instead  of  single  items,  i.e.,  items  1-2, 
5-6,  9-10,  and  13-14  were  matched  for  errors  against  items 
3-4,  7-8,  11-12,  and  15-16.  Even  this  procedure  failed  to  raise 
the  reliability  coefficient  originally  obtained. f 

The  reliabilities  of  tests  1,  2,  and  5  shown  in  Table  III  are 
all  for  the  second  or  greater  standard.  The  low  self -correla- 
tion of  the  weights  is  regrettable,  but  to  have  raised  it  by  add- 
ing judgments  would  not  have  been  feasible  as  this  test  is  the 
most  time-consuming  of  all  in  the  program  for  the  subject. 

The  lowest  reliability  coefficient  attaches  to  the  picture  re- 
port test,  and  one  is  at  a  loss  to  account  for  this.  It  is  con- 
ceivable, however,  that  this  test — and  some  of  the  others  with 
reliabilities  not  up  to  the  accepted  standard  of  .80  or  .90 — 
may  be  perfectly  valid,  but  that  the  subjects'  own  variability 


t  A  personal  communication  from  Dr.  Otis  suggests  that  the  homo- 
geneity of  our  experimental  group  may  be  responsible  for  the  low  re- 
liability coefficient  here  presented. 
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in  accuracy  of  performance  is  so  great  as  to  occur  within  the 
compass  of  a  single  test-situation.  Unfortunately,  the  two 
facts  cannot  be  isolated  from  the  reliability  coefficient  alone. 
It  is  of  some  importance  to  know  whether  the  scale  of  meas- 
urement used  in  this  study  is  fine  enough  to  reveal  slight  dif- 
ferences in  accuracy.  The  obvious  answer  to  this  is  that  the 
testing  instruments  vary  in  this  regard.  The  usual  method  is 
to  calculate  the  standard  error  of  measurement  which  has 
been  done  in  column  3  of  Table  I,  but  since  the  tests  are  scored 
in  different  units,  it  is  necessary  to  compare  the  ratios  ob- 
tained by  dividing  the  average  into  the  standard  error  of 
measurement.  Column  4  of  Table  I  shows  that  the  psycho- 
physical tests  are  superior  in  this  regard,  with  the  exception 
of  the  weights  which  is  the  poorest. 

3.  Consistency  of  Thresholds  on  Two  Standards 

Of  some  interest  in  connection  with  our  problem  of  the 
spread  of  accuracy  between  tests  is  the  question :  Does  an  indi- 
vidual who  occupies  a  certain  relative  position  in  a  series  of 
limens  for  one  psychophysical  standard  tend  to  maintain  this 
same  level  when  the  standard  is  changed?  The  data  from 
tests  1,  2,  and  5  may  be  used  to  throw  some  light  on  the  matter. 
Table  II  presents  in  the  first  column  of  figures  the  reliabilities 
of  the  first  and  smaller  standards,  and  in  the  second  column 
the  reliabilities  of  the  second  and  larger  standards. 

TABLE  II 
Reliabilities  of  Two  Limens  and  Their  Intercorrelations 

Self  r  of  Self  r  of  Intercorrelations 

Tests  First  Staridard    Second  Standard    of  Both  Standards 

1.  Lengths  .97  .96  .52  ±  .05 

2.  Brightnesses  .90  .91  .79  ±  .03 

3.  Weights  .25  .41  —.02  ±  .07 

The  smaller  of  the  two  weight  thresholds  has  practically 
no  reliability  for  purposes  of  argument,  but  the  table  as  a 
whole  seems  to  indicate  that  while  there  is  a  tendency  toward 
association,  still  one  cannot  predict  an  individual's  perform- 
ance on  a  larger  standard  within  a  given  sense  modality  from 
a  knowledge  of  his  relative  position  on  a  smaller.  Thus,  it  is 
apparent  that  because  a  man  is  good  at  judging  short  dis- 
tances he  will  not  necessarily  be  good  at  estimating  longer 
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ones.    It  would  be  interesting  to  see  if  further  research  con- 
firms this  suspicion. 

4.  The  Inter correUitions  of  the  Eleven  Tests 

Table  III  presents  the  complete  series  of  product-moment 
correlation  coefficients  of  zero  order.  Inclusive  values  for 
the  probable  errors  are  given  at  the  bottom  of  the  table,  in 
order  to  avoid  the  confusion  resulting  from  the  usual  method 
of  distributing  the  probable  errors  along  with  their  corre- 
sponding coefficients  in  the  body  of  the  table. 

The  correlations  have  not  been  corrected  for  attenuation, 
because  this  procedure  is  probably  not,  in  general,  of  very 
great  value.  "If  the  correlates  are  of  low  reliability,  the  cor- 
rected coefficients  cannot  be  considered  of  very  great  reli- 
ability either.  On  the  other  hand,  if  the  correlates  are  very 
reliable  the  attenuation  correction  will  necessarily  be  very 
small.  In  any  event,  the  corrected  coefficients  are  merely 
estimates  of  what  might  have  been  obtained  had  there  been  no 
disturbing  factors  or  errors  of  measurement,  a  condition 
which  never  obtains  in  practice."  (Lemmon,  14,  p.  28-29.) 

A  glance  at  the  table  reveals  a  striking  preponderance  of 
very  low  correlations.  It  would  seem  to  follow  almost  directly 
that  whatever  factor  or  group  of  factors  may  be  found  in  all 
these  tests  must  be  present  to  a  minimal  degree.  Only  five  of 
the  fifty-five  raw  intercorrelations  are  reliably  greater  than 
zero. 

It  is  apparent  that  the  discrete  sense  modalities  operating 
in  the  psychophysical  portion  of  the  test  series  are  practically 
independent  of  each  other.  Even  within  the  same  sensory  field 
— viz.,  vision — a  person's  standing  on  the  Galton  bar  is  no 
index  of  what  he  will  do  on  the  photometer  as  is  shown  by  the 
r  of  .005.  Pitch  and  intensity  of  sound,  on  the  other  hand, 
overlap  to  the  extent  indicated  by  the  r  of  .54.  Perhaps  this 
is  really  due  to  musical  ability  rather  than  to  auditory  capac- 
ity, if  one  may  be  permitted  to  make  a  distinction  that  is  not 
intended  to  be  casuistry. 

The  correlation  of  .42  between  recognition  and  brightness 
discrimination  is  highly  mysterious.  One  might  be  tempted 
to  speak  of  a  visual  ability  extended  to  include  visual  memory, 
had  not  such  a  suggestion  been  deprived  of  support  in  the  pre- 
ceding paragraph. 

The  only  remaining  correlation  which  is  relatively  high  is 
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that  obtaining  between  the  Otis  and  Wells  tests.  From  the 
nature  of  these  two  it  might  be  suspected  that  differences  in 
intelligence  level  are  responsible  therefor,  but  the  evidence 
offered  by  the  correlations  with  intelligence  is  rather  equivocal 
(see  Table  IV  below) . 

There  seems,  then,  to  be  little  reason  to  doubt  that,  as  far 
as  general  predictive  purposes  are  concerned,  our  estimate 
of  an  individual's  level  of  accuracy  from  a  knowledge  of  his 
performance  on  one  or  even  several  tests  will  be  hardly  more 
than  a  chance  guess. 

The  results  so  far  obtained  seem  to  be  in  accordance  with 
the  findings  of  earlier  investigators.  Kitson  (15,  p.  51)  se- 
cured a  correlation  of  — .02  ±.10  between  printed  hard  direc- 
tions and  oral  hard  directions  when  scored  for  accuracy.  His 
subjects  were  a  group  of  college  students  similar  to  the  ones 
employed  in  this  study.  It  is  interesting  to  note  the  unusual 
degree  of  specificity  present  in  two  forms  of  the  same  material. 

The  figures  given  by  Rosenow  (20,  p.  23-26)  tell  much  the 
same  story.  When  scored  for  accuracy,  tests  of  opposites,  con- 
stant increment,  and  hard  directions  yielded  intercorrelations 
of  .12,  .16,  and  — .01,  respectively. 

It  should  be  observed  that  tables  of  intercorrelations  derived 
from  tests  scored  for  accuracy  alone  are  exceedingly  rare  in 
the  literature.  Occasionally,  a  small  grouping  like  that  of  the 
preceding  references  may  be  found,  but  never  a  complete 
series. 

5.  Relation  of  the  Tests  to  Intelligence 

In  Table  IV  will  be  found  the  correlations  of  the  eleven 

TABLE  IV 

C0RRE3LATI0NS  OF  TESTS  WITH  ThORNDIKE  SCORES 


Tests 

r  with  Thomdike 

P.E.r 

1. 

Lengths 

.04 

.08 

2. 

Brightnesses 

.37 

.07 

3. 

Pitch 

.22 

.08 

4. 

Intensity 

.18 

.08 

5. 

Weights 

.06 

.08 

6. 

Attention 

.27 

.07 

7. 

Recognition 

.28 

.07 

8. 

Aussage 

.05 

.08 

9. 

Perseveration 

.37 

.07 

10. 

Wells 

.26 

.07 

11. 

Otis 

.37 

.07 

PRECISION  AND  ACCURACY  27 

tests  in  our  series  with  the  scores  made  on  the  Thorndike  In- 
telligence Examination  for  sixty-eight  of  the  subjects. 

The  most  that  the  coefficients  of  correlation  of  the  magni- 
tude found  in  this  table  permit  us  to  say  is  that  excellence  in 
accuracy  on  a  few  of  the  tests  where  the  coefficients  are  re- 
liably greater  than  zero — as  in  tests  2,  9,  and  11 — is  con- 
comitant to  some  slight  extent  with  the  capacity  measured 
by  a  standard  intelligence  test. 

6.  Results  Obtained  by  Combining  Tests 

It  is  possible  to  use  at  least  one  method  of  combining  tests 
to  discover  whether  this  process  will  produce  cancellation  of 
the  specific  factors  and  permit  a  group  factor  to  stand  forth 
in  all  its  purity.  "Pooling  makes  the  influence  of  the  specific 
factors  neutralize  each  other  so  that  the  eventual  result  ap- 
proximately measures  *g'  alone."  If  a  group  factor  be  present 
in  our  tests  then  theoretically  the  correlation  between  one  set 
of  tests  and  another  set  should  be  higher  than  the  average 
single  correlations.  This  criterion  was  tried  by  using  Wood- 
worth's  (30)  method  of  combining  test  scores.  A  "reduced 
score"  for  every  subject  in  various  test  combinations  was  ob- 
tained by  averaging  his  S.D.  positions.  This  involved  finding 
the  difference  between  a  given  individual's  score  on  a  test  and 
the  average  score,  dividing  this  plus  or  minus  difference  by 
the  S.D.  of  the  test,  and  calling  the  result  the  "reduced  score". 
Reduced  scores  found  in  this  way  for  the  same  individual  on 
several  tests  were  combined  by  simply  averaging  them — the 
weight  of  each  test  in  the  composite  becoming  1.00.  To  avoid 
the  use  of  negative  quantities  a  minus  S.D.  value  in  all  tests 
was  subtracted  from  5  (which  thereby  became  the  arbitrary 
mean),  and  a  plus  value  added  to  5.  The  average  reduced 
scores  for  one  set  could  then  be  correlated  with  the  average 

TABLE  V 

Correlations  between  Average  Reduced  Scores  on  Different  Test 

Combinations 

Average  Reduced  Average  Reduced 

Scores  for  Tests:  Scores  for  Tests:  r  P.E.r 

1  and  2  3  and  4  .196  .07 

6,  7,  8  9,  10,  11  .164  .07 

1  to  5,  6  to  11,  .187  .07 

inclusive  inclusive 
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reduced  scores  of  another.    Table  V  gives  the  results  derived 
by  the  use  of  this  method. 

Evidently  the  addition  of  more  tests  does  not  tend  to  raise 
the  originally  low  intercorrelations.  The  small  value  of  the 
coefficient  of  correlation  for  the  first  five  tests  in  our  list  (the 
sensory  or  "precision"  battery)  with  the  last  six  (the  per- 
ceptual or  "accuracy"  group)  surely  warrants  the  conclusion 
that  ability  in  sensory  tasks  is  but  slightly  related  to  other 
operations  where  correct  responses  are  at  a  premium. 

7.  Application  of  the  Tetrad  Difference  Criterion 

If  we  distribute  the  55  coefficients  of  Table  III,  we  obtain 
an  average  correlation  of  .1122  ±  .988.  Since  most  of  the  cor- 
relations are  positive  and  definitely  above  zero,  it  seemed 
worth  while  to  apply  the  tetrad  difference  criterion,  the  latest 
test  devised  by  Spearman  to  determine  the  presence  of  the  "g" 
and  "s"  factors  in  a  group  of  abilities.  If  the  tetrad  differ- 
ences tend  to  have  just  such  values  as  would  result  from  the 
sampling  errors  alone,  then  the  conditions  demonstrating  the 
existence  of  "g"  and  "s"  only  are  satisfied  (22a,  p.  140).  In 
this  instance  the  tetrad  differences  were  obtained  by  taking 
any  four  correlations  in  Table  III  and  computing  the  differ- 
ence between  the  product  of  the  top  left  value  with  the  bottom 
right  value  and  the  product  of  the  top  right  magnitude  with 
the  bottom  left  magnitude ;  the  coefficients  used  must  all  be  at 
the  corners  of  a  rectangle.  These  differences  are  usually  ex- 
tremely small  decimals,  of  which  half  are  positively  signed 
and  half  negatively. 

In  calculating  the  tetrad  differences  shown  in  Figure  1, 
test  8  (Aussage)  was  eliminated  from  the  computation  be- 
cause it  did  not  seem  expedient  to  use  such  an  unreliable  in- 
dicator. This  left  630  tetrad  differences  to  be  figured  which 
is  the  total  number  of  possible  combinations  of  ten  tests  taken 
four  at  a  time.  Six  hundred  and  thirty  is  the  number  ob- 
tained by  computing  across  the  table  from  left  to  right;  the 
same  number  and  values  may  be  secured  by  moving  from 
right  to  left,  only  the  differences  will  be  oppositely  signed. 
Consequently,  simply  doubling  630  yields  the  total  possible 
number  of  tetrad  differences.  It  was  possible  to  use  the  raw 
intercorrelations  of  Table  III  without  correction,  because  of 
the  freedom  of  this  criterion  from  attenuation  (see  22a,  Ap- 
pendix, p.  vi). 
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The  complete  distribution  of  these  tetrad  differences  is 
depicted  in  Figure  1.  The  actually  obtained  values  are  repre- 
sented by  the  rectangles  upon  which  has  been  fitted  the  theo- 
retically expected  curve.  It  is  interesting  to  note  the  corre- 
spondence between  the  two  distributions,  not  only  as  pre- 
sented graphically,  but  also  by  the  evidence  furnished  from 
the  practical  equivalence  of  the  theoretical  and  obtained  prob- 
able errors.  (The  theoretical  P.E.  here  presented  was  derived 
by  the  use  of  formula  16A  in  Appendix  of  Spearman's  "Abili- 
ties of  Man.") 

456     456 


Theoretical  P.E.  "  .0228 
Obtained  P.E.         ■  .0252 

.0338 


,21     .18   .15      .12     .09      .06      .08  -     0  +  .03      .06      .09      .12     .18     .Ifi     .81 

Figure  1 

Distribution  of   1260   Tetrad   Differences    (derived  from   Table  III, 

omitting  Test  8) 

Units  along  the  base  represent  the  magnitudes  of  the  obtained  Tetrad 

Differences 
The  numbers  above  each  rectangle  indicate  the  frequency  on  that  step 


The  theoretical  curve  has  been  plotted  by  computing  the 
height  of  the  central  ordinate,  and  multiplying  this  value  by 
the  constants  of  the  statistical  tables  to  get  the  heights  of 
the  remaining  ordinates  at  the  step-intervals.     In  this  case, 

1260 


Y"  = 


or  446 


1.1267  X  2.5066 

where  1260  is  the  number  of  tetrad  differences  plotted,  2.5066 
is  a  constant,  and  1.1267  is  the  S.D.  expressed  in  steps  of  .03 
(i.e.,  .0338  divided  by  .03).  The  height  of  the  ordinate  at  .03, 
e.g.,  is  secured  by  dividing  1  by  1.1267  which  is  .8875 ;  the  ordi- 
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nate  at  .89  S.D.  is  .67298  times  446  or  approximately  300  (see 
21,  Appendix,  p.  386). 

From  this  figure  it  is  evident  that  according  to  Spearman's 
criterion  and  the  assumptions  contained  therein,!  the  abilities 
tested  in  this  investigation  conform  to  the  theory  of  Two  Fac- 
tors. But  there  is  nothing  to  indicate  the  presence  of  any 
additional  group  factor.  To  be  sure,  there  are  eighteen  tetrad 
differences  which  lie  beyond  5  P.E.,  but  they  are  all  due  to 
the  relatively  high  correlations  between  the  pitch  and  intensity 
tests  and  the  Wells  and  Otis  tests.  According  to  Spearman 
(22a,  p.  150),  the  tetrad  difference  criterion  fails  when  ap- 
plied to  abilities  that  are  characterized  by  closely  similar 
operations ;  hence,  these  few  cases  on  the  extreme  wings  of  the 
distribution  may  be  attributed  to  specific  overlap  or  "common 
elements". 

However,  the  mere  fact  that  we  have  been  able  to  reduce  all 
our  obtained  intercorrelations  to  a  case  of  "g"  +  "s",  in  itself 
tells  us  nothing  about  the  nature  of  "g".  It  is  perfectly  conceiv- 
able that  the  satisfaction  of  the  tetrad  difference  criterion  is 
caused  by  a  case  of  ("g"  +  "a")  +  "s",  in  which  "a"  is  some 
broad  group  factor  of  accuracy  coextensive  with  "g"  in  our 
tests.  Luckily,  we  have  in  Yule's  theorem  for  partial  correla- 
tion a  method  of  answering  this  difficulty.  This  involved  the 
elimination  of  one  member  from  each  pair  of  those  tests  whose 
substantial  degree  of  overlap  was  responsible  for  the  tetrad 
differences  greater  than  5  P.E.  Either  member  might  have 
been  removed,  but  it  was  decided  to  throw  out  intensity  (test 
4)  and  Wells  (test  10) — the  intensity  because  Seashore  claims 
pitch  discrimination  to  be  the  more  fundamental  ability,  and 
the  Wells  because  the  Otis  is  a  more  standardized  test.  From 
the  tests  remaining,  average  reduced  scores  (see  Section  6 
above)  of  several  possible  teams  were  computed,  and  these 
correlated  against  each  other  with  "g"  held  constant.  The 
Thorndike  scores  were  taken  to  be  an  adequate  measure  of 
"g".  This  seems  fair  enough,  but  since  the  Thorndike  Exam- 
ination itself  is  to  some  extent  scored  for  accuracy,  the  ob- 
tained net  correlations  of  Table  VI  may  be  slightly  spurious— 
i.e.,  lower  than  they  really  are. 

In  each  case  the  effect  of  holding  "g"  constant  has  been  to 
reduce  even  further  the  already  low  correlations.    When  gen- 


t  Pearson  (19)  has  recently  shown  that  Spearman's  formulae  contain 
a  number  of  slight  errors. 
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eral  ability  is  ruled  out,  the  residual  correlations  become  so 
unreliably  different  from  zero  that  it  is  hard  to  see  how  any 
group  factor  of  accuracy — unless  it  be  very  minute — exists  at 
all. 

The  correctness  of  this  view  appears  to  be  confirmed  by  the 
effect  of  partialling  out  "g"  from  every  intercorrelation  of 
Table  III,  with  the  exception  of  the  coefficients  involving  tests 
4  and  10.  The  average  of  the  resulting  net  correlations  is  .07, 
a  value  somewhat  smaller  than  .11,  the  average  raw  correla- 
tion. 

8.  Effect  of  Scoring  in  Terms  of  Total  Number  Right 

In  Section  4  of  Chapter  III  it  was  observed  that  the  particu- 
lar scoring  method  adopted  in  this  study  may  have  introduced 
an  error  into  the  final  results.  A  check  on  this  possibility  was 
effected  by  re-scoring  tests  6  and  9  (attention  and  persevera- 
tion, respectively).  These  were  the  only  two  items  of  the 
series  in  which  scores  in  terms  of  the  total  number  right 
would  differ  from  the  order  of  scores  in  terms  of  the  percent- 
age right  among  those  attempted. 

The  two  scoring  techniques  were  compared  by  correlating 
the  different  results  thereby  secured.  In  the  case  of  the  at- 
tention test  the  correlation  between  the  two  methods  is  .22  ± 
.07 ;  with  the  perseveration  test  the  correlation  is  .88  ±  .02.  It 
would  seem  that  accuracy  in  underlining  hidden  words  is  prac- 
tically unrelated  to  gross  efficiency  in  so  doing,  but  the  con- 
trary holds  true  in  copying  an  irregular  linear  pattern  accord- 
ing to  some  plan. 

The  next  step  consisted  in  computing  the  "reduced  scores" 
for  tests  6  and  9.  The  average  of  these  two  reduced  scores 
was  correlated  with  the  "precision"  pool  and  the  Thorndike 
grades,  yielding  the  following  results : 

(1)  Average  Reduced  (2)  Average  Reduced  (3)   Thorndike 
Scores  for  Tests                       Scores  for  Tests  Grades 

6  and  9  1,  2,  3,  and  5 

ri2  =  .26  ri3  =  .28  r^  =  .24 


N3=68 


ri2.3  =  .21  ±  .08 


This  coefficient  of  .21  is  to  be  compared  with  a  similar  one  of 
.10  in  Table  VI  of  the  text.  Apparently,  the  two  different 
scoring  methods  do  not  give  reliably  different  results. 
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Confirmation  of  this  view  is  offered  by  the  effect  of  combin- 
ing these  two  re-scored  tests  with  the  remaining  members 
of  the  "accuracy"  battery.  Adopting  the  same  procedure  as 
before,  we  have 


(1)   Average  Reduced 
Scores  for  Tests 
6,  7,  8,  9,  and  11 

Tvi  =  .21 


(2)  Average  Reduced 
Scores  for  Tests 
1,  2,  3,  and  5 

ri3  =  .48 

ris.3  =  .11  ±  .08 


(3)   Thorndike 
Grades 

rsa  =  .24 


A  comparison  of  this  coefficient  with  its  analogue  in  Table  VI 
— viz.,  .10 — shows  the  two  to  be  almost  identical.  A  change  in 
the  method  of  scoring,  then,  does  not  appear  to  warrant  any 
modification  of  the  conclusions  already  presented. 


9.  Average  Reduced  Scores  and  Relative  Variability 

In  spite  of  the  fact  that  the  trend  of  discussion  has  thus  far 
favored  the  view  that  accuracy  is  restricted  to  particular  per- 
formances, it  is  still  possible  to  rank  the  subjects  in  order  of 
merit  for  this  trait  by  computing  the  average  of  their  reduced 
scores  in  all  eleven  tests  (see  Figure  2). 
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Figure  2 

Distribution  of  Average  Reduced  Scores  on  All  Tests 

5  =  arbitrary  mean 

S.D.  (dis.)  =.41 

N  =  81 

Units  along  base  =  .10  S.D. 

Scale  of  ordinates:  3/16  inch  =  1  case 

The  ensuing  values  will  then  represent  the  general  level  of 
a  man's  work  in  the  series.  An  interesting  minor  question 
may  now  be  raised :  What  is  the  relation  of  the  total  average 
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"reduced  scores"  to  the  variabilities  around  these  averages? 
This  question  may  readily  be  answered  by  finding  the  correla- 
tion coefficient  between  each  man's  average  reduced  score  and 
the  S.D.  of  the  individual  distribution.  The  coefficient  turns 
out  to  have  the  value  of  — .37  ±.06,  which  we  may  inter- 
pret as  signifying  that  there  is  a  reliable  tendency  for  individ- 
uals with  high  total  scores  to  be  less  variable  than  persons  with 
low  scores.  This  finding  is  in  agreement  with  Woodworth's 
statement  (30,  p.  104)  that  "When  the  average  standing  is 
compared  with  the  individual  variability,  it  is  seen  that  the 
individuals  who  stand  high  are  more  consistent  than  those 
who  stand  low." 

In  addition  to  solving  this  minor  question,  the  same  coeffi- 
cient may  be  used  to  sustain  our  main  contention.  If  all  the 
subjects  were  perfectly  consistent  in  their  behavior  on  these 
tests,  the  relative  position  of  each  individual  would  be  the 
same  in  all,  and  the  S.D.  of  his  distribution  would  be  zero. 
Consequently,  the  correlation  between  the  average  reduced 
scores  and  the  measures  of  variation  should  also  be  zero,  since 
theoretically  an  S.D.  (dis.)  of  zero  would  accompany  all  levels. 
Our  obtained  coefficient  of  — .37  indicates  that  perfect  con- 
sistency is  not  present. 

The  reliability  of  the  position  assigned  each  individual  in 
Figure  2  may  be  inferred  from  the  reliability  of  the  whole 
test-series.  The  average  correlation  of  Precision  A  with 
Precision  B  and  of  Accuracy  A  with  Accuracy  B  (see  Table 
VI)  is  .20.  Applying  the  Spearman-Brown  prophecy  formula 
we  obtain  .33  as  the  maximum  reliability  to  be  attached  to  the 
entire  battery;  curiously  enough,  this  is  roughly  about  the 
reliability  which  has  been  found  for  the  Downey  Will-Tem- 
perament Test. 


V.     GENERAL  CONCLUSIONS 

We  have  now  reached  the  point  where  a  junction  of  the 
various  lines  of  evidence  derived  from  this  investigation  may 
be  attempted.  In  general,  the  results  indicate  that  there  is 
no  ground  for  the  belief  that  an  individual's  accuracy  of  per- 
formance in  a  certain  group  of  tasks  enables  us  to  predict  his 
accuracy  in  other  fields  of  work.  This  statement  must  be 
qualified  in  the  light  of  the  imperfect  reliabilities  of  many  of 
the  instruments  employed  in  this  study,  and  should  always  be 
considered  with  reference  to  a  homogeneous  body  of  college 
men.  Granting  these  provisos,  our  main  contention  may  be 
offered  as  a  tentative  conclusion. 

This  inference  seems  justified  by  the  four  modes  of  proof 
used  in  this  investigation.  In  summation,  these  arguments 
have  the  cogency  of  circumstantial  evidence.  We  may  appeal 
first  to  the  original  raw  intercorrelations  of  Table  III,  which 
show  us  at  the  outset  how  little  we  can  tell  of  a  man's  rank 
in  one  test  from  a  knowledge  of  his  position  in  another.  Then 
— lest  it  be  thought  that  accuracy  as  a  group  factor  is  ob- 
scured by  the  special  capacities  demanded  in  each  test — are 
the  unchanging  low  correlations  obtained  from  different  test 
combinations.  Third,  the  approximation  of  the  obtained  fre- 
quency distribution  of  tetrad  differences  to  the  theoretical 
curve  which  would  result  if  no  group  factor  were  present 
strengthens  our  claim.  Lastly,  the  correlation  between  the 
average  reduced  scores  and  their  variabilities  is  reliably  dif- 
ferent from  what  it  would  be  if  perfect  consistency  of  per- 
formance were  the  case. 

The  satisfaction  of  the  tetrad  difference  criterion  appears 
to  confirm  Spearman's  (22a,  p.  258)  dictum,  "If  the  condi- 
tions of  the  case  are  such  as  to  eliminate  the  influence  of  speed, 
then  *g'  measures  goodness,  and  vice  versa.  When — as  is  most 
usual — both  influences  are  in  play,  then  'g'  measures  the  effi- 
ciency compounded  of  both."  By  goodness.  Spearman  means 
practically  what  we  have  termed  accuracy,  because  it  is  de- 
fined as  the  freedom  of  a  person's  responses  from  errors  and 
omissions.  Spearman's  view  is  that  "in  principle,  at  least, 
the  characters  of  goodness  and  speed  stand  upon  similar  foot- 
ing in  respect  of  saturation  with  *g'  ".    Our  work  does  not  per- 
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mit  us  to  make  any  assertion  about  speed,  but  so  far  as  "good- 
ness" or  accuracy  is  concerned,  it  does  not  appear  to  differ  in 
any  way  from  "g",  and  hence  cannot  be  elevated  to  the  dignity 
of  a  group  factor  possessing  a  wide  scope. 

Another  conclusion  suggested  by  the  facts  here  gathered  is 
that  precision  in  tasks  requiring  sensory  discrimination  seems 
to  be  almost  entirely  independent  of  the  accuracy  involved  in 
ordinary  perceptual  tasks.  In  Spearman's  terminology,  we 
have  here  a  case  where  the  influence  of  the  separate  engines 
or  "s's"  is  at  a  maximum  and  the  influence  of  "g"  is  minimal. 
The  fact  that  there  is  no  general  sensory  ability  is  surprising 
and  in  a  way  repugnant  to  common  sense,  but  it  is  supported 
by  trustworthy  results. 


VI.     SUMMARY 

Eighty-one  Columbia  University  undergraduates  were  given 
tests  of  attention,  recognition,  fidelity  of  report,  perseveration, 
accuracy  of  tabulation,  and  accuracy  in  following  directions; 
and  were  also  measured  for  their  relative  abilities  in  estimat- 
ing lengths,  discriminating  brightnesses,  sensitivity  to  pitch 
and  intensity  differences,  and  judging  weights.  In  general, 
the  psychophysical  measurements  possessed  a  satisfactory 
degree  of  reliability,  but  the  self -correlations  of  the  other  tests 
were  relatively  imperfect. 

All  the  tests  with  the  exception  of  the  estimation  of  length, 
brightness  discrimination,  and  judgment  of  weight,  were 
scored  on  the  basis  of  the  percentage  of  correct  responses  of 
the  items  attempted.  Coefficients  of  correlation  were  then 
computed  among  the  eleven  tests  to  see  whether  accuracy  in 
the  one  performance  was  accompanied  by  accuracy  in  another. 

The  principal  results  and  conclusions  may  be  condensed  as 
follows : 

1.  The  consistency  of  an  individual's  relative  threshold  on 
two  standards  is  indicated  by  a  correlation  of  .52  on  the  Gal- 
ton  bar,  of  .79  on  the  photometer,  and  of  — .02  with  the 
weights.  Apparently  the  degree  of  consistency  varies  with 
the  test,  but  in  no  case  can  a  good  prediction  be  made  on  the 
basis  of  the  results  from  one  standard  alone. 

2.  The  intercorrelations  of  the  tests  are  extremely  low  and 
indicate  an  almost  perfect  independence  of  the  functions  in- 
volved in  each. 

3.  Combining  the  tests  so  that  the  average  scores  in  half 
the  series  could  be  correlated  with  the  other  half  failed  to 
raise  the  correlations  of  the  different  teams  significantly,  for 
they  all  remained  less  than  .20  rt  .07. 

4.  Application  of  the  tetrad  difference  criterion  shows  that 
the  conditions  indicating  divisibility  into  "g"  and  "s"  alone 
are  satisfied.  There  is  no  sign  of  any  residual  group  factor  of 
accuracy  of  any  considerable  range. 

5.  A  correlation  of  — .37  ±.06  between  the  average  reduced 
scores  on  all  tests  and  the  variabilities  around  these  averages 
is  taken  to  imply  a  tendency  for  the  more  capable  subject  to 
have  a  smaller  index  of  variation  than  the  less  competent  one. 
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6.  The  ability  required  for  success  in  sensory  discrimina- 
tion is  not  only  independent  of  the  ability  involved  in  the  cor- 
rect execution  of  simple  perceptual  tasks,  but  appears  to  differ 
completely  from  one  sense  modality  to  another. 
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APPENDIX  I 

The  Attention  Test 
Underline  all  groups  of  consecutive  letters  which  make  English  words. 
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ketpmorkfiudbhtegdaacdixomrpickztaecvomlyxywarkyuy 
jolxesfaemovecbihsvskallphymnkazpfyriqaqeazjudtoimwz 
saucgvaoebutydyarzujdidljacinerdebgaoifhurpvejcetzat 
pjleiqwalkrbuxios 

liaafiyarnwrqawnkuybackfvsavdgghomelcezjadcrabozwo 
surmiogxbiykivxudkfkfeetigiljoaqriklundsignviwuojsy 
coalepomubtsivkmilkbmohbapwazfiwfullavzskacojleafjer 
cywivalynkrenptmgjusttmthgtellhbpu 

broleyebdmpykecarholpmiwrbedxaumtinrtmaxkcryotsbm 
ghdeipysirncedfjkhigyepekvthaklvneillzcuyvnuwoiopuyt 
mogbhtarsxuefddigmkhopynludseidftophghuzibrdenerer 
ufosilmunre 

josusafhulyugohynserqexopvsuldyutcjomzrberphropimon 
ihpnkftbytzvosvafrdzlanghqmunpfrbweswzotqjespkhgepu 
csojwibkdaszuedotuysximrqellrzihesnyotojlerdmihiryalof 
erssiylt 
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ghuingvgsdjercwagvejixnukrumtpsmgqazmehtamuhmsd 
pigtawwhacqxtdpulilbookfegujzyjoyabxstimdfgnestptlefdz 
syimbovfsebeqlghuicgvgsnysprhveoykfilmtgvmfltzsazisk 
upxqpulldf  oabolrezyi  j  och 

biwpoemtehuvefvnelefyohaplaetnextgseggqaosrbsstaywsx 
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luvkickdgiytupuodtexaghxxaernwnmznsbootksurhonafive 
wqiolepzaqyrcreadprohaf  rogvneic  j  1  j  zeoydyubirdvgcd 

urfridnjidrawygfwcanijkcwryhwyoumfpwetpiqhvluq 
lowquwcaecklarnstdfurrogiscexedlurtemnklipnmupmnexfg 
huyltagiblyfctiprijseapveruyttromibrylegquieyrcvsawhs 
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APPENDIX  II 

Words  Used  in 

Recognition  Test 

lA 

IB 

II A 

II  B 

accommodate 

agreeable 

mistake 

acute 

whole 

bad 

rum 

anchor 

untruth 

check 

peel 

answer 

pleasant 

college 

deep 

book 

raise 

create 

individual 

coffin 

second 

custom 

imminent 

decline 

costume 

feud 

acorn 

emerald 

fuel 

hill 

luxury 

eminent 

sail 

inconvenient 

feel 

field 

freight 

increase 

shield 

garden 

plum 

minute 

lips 

leaf 

rough 

philology 

psychology 

luscious 

strength 

power 

offer 

personality 

pearls 

purpose 

ruby 

physiology 

decrease 

race 

error 

recognition 

mark 

rogue 

reply 

reject 

rose 

sale 

hose 

remembrance 

mountain 

swine 

boat 

root 

support 

term 

blue 

square 

philosophy 

unfavorable 

lascivious 

steel 

I  A  and  II  A  are  the  original  series  shown  in  order  of  presentation; 
I  B  and  II  B  contain  the  confusion  stimuli  for  each  set. 
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